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Abstract 

We  address  discrete-time  pursuit-evasion  games  in  the  plane  where  every  player  has  identical  sensing  and  motion  ranges 
restricted  to  closed  discs  of  given  sensing  and  stepping  radii.  A  single  evader  is  initially  located  inside  a  bounded  subset  of  the 
environment  and  does  not  move  until  detected.  We  propose  a  Sweep-Pursuit-Capture  pursuer  strategy  to  capture  the  evader  and 
apply  it  to  two  variants  of  the  game:  the  first  involves  a  single  pursuer  and  an  evader  in  a  bounded  convex  environment  and  the 
second  involves  multiple  pursuers  and  an  evader  in  a  boundaryless  environment.  In  the  first  game,  we  give  a  sufficient  condition  on 
the  ratio  of  sensing  to  stepping  radius  of  the  players  that  guarantees  capture.  In  the  second,  we  determine  the  minimum  probability 
of  capture,  which  is  a  function  of  a  novel  pursuer  formation  and  independent  of  the  initial  evader  location.  The  Sweep  and  Pursuit 
phases  reduce  both  games  to  previously-studied  problems  with  unlimited  range  sensing,  and  capture  is  achieved  using  available 
strategies.  We  obtain  novel  upper  bounds  on  the  capture  time  and  present  simulation  studies  that  address  the  performance  of 
the  strategies  under  sensing  errors,  different  ratios  of  sensing  to  stepping  radius,  greater  evader  speed  and  different  number  of 
pursuers. 
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On  Discrete-time  Pursuit-evasion  Games 
with  Sensing  Limitations 


I.  Introduction 

The  game  of  pursuit  can  be  posed  as  to  determine  a  strategy 
for  a  pursuer  (a  team  of  pursuers)  to  capture  an  evader  in  a 
given  environment.  By  capture ,  we  mean  that  the  evader  and 
the  pursuer  (some  pursuer)  meet  at  the  same  location  after  a 
finite  time.  The  aim  of  the  pursuer  (pursuers)  is  to  capture  the 
evader  for  any  evader  trajectory.  The  evader  wins  the  game  if 
it  can  avoid  capture  indefinitely.  All  the  players  have  identical 
motion  capabilities.  Capture  strategies  are  important  in  surveil¬ 
lance  where  we  would  like  to  detect  and  capture  equally  agile 
intruders.  Another  application  is  search-and-rescue  operations 
where  a  worst-case  capture  strategy  guarantees  a  rescue,  in 
spite  of  any  unpredictable  motion  of  the  victim. 

The  continuous  time  version  of  this  game  has  been  studied 
in  [1],  [2]  and  [3]  to  cite  a  few.  Recently,  the  discrete-time 
version  of  the  game  has  received  significant  attention.  [4] 
describes  sufficient  conditions  and  a  strategy  for  a  single 
pursuer  to  capture  an  evader  in  a  semi-open  environment. 
This  strategy  has  been  extended  in  [5]  to  the  case  of  multiple 
pursuers  in  an  unbounded  environment,  to  capture  a  single 
evader  which  is  inside  their  convex  hull.  [6]  and  [7]  describe 
pursuer  strategies  of  moving  towards,  and  towards  the  previous 
positions  of  the  evader  respectively,  so  that  the  distance  to 
the  evader  is  reduced  to  a  finite,  non-zero  amount  in  finite 
time.  The  game  has  also  been  studied  in  different  types  of 
bounded  environments,  e.g.,  circular  environment  [6],  curved 
environments  [8],  Visibility -based  pursuit  evasion  has  been 
studied  in  a  continuous-time  setting  in  [9]  and  in  polygonal 
environments  in  a  discrete-time  setting  in  [10]. 

In  the  context  of  sensing  limitations,  in  continuous  time 
formulations,  [11]  deals  with  a  version  of  visibility  limited  to 
an  angle,  instead  of  the  entire  region.  [12]  considers  a  suc¬ 
cessive  pursuit  of  multiple  evaders  by  a  single  faster  pursuer 
in  the  plane  with  sensing  range  limited  to  a  finite  disc.  [13] 
proposes  a  multi-phase  pursuit  strategy  for  groups  of  pursuers 
with  limited  range  sensing  and  have  demonstrated  its  capture 
properties  in  bounded  environments  via  simulation  without 
formal  proof.  In  discrete  time  formulations,  [14]  considers  the 
problem  on  a  graph,  with  the  visibility  of  the  pursuer  limited  to 
nodes  adjacent  to  the  current  node  of  a  pursuer.  A  framework 
which  uses  probabilistic  models  for  sensing  devices  for  the 
agents  is  described  in  [15]  and  [16]. 

We  address  the  case  of  limited  range  sensing  capability: 
a  pursuer  and  an  evader  can  sense  each  other  only  if  the 
distance  between  them  is  less  than  or  equal  to  a  given  sensing 
radius.  We  consider  the  discrete-time  version  with  one  or  many 
pursuers  and  a  single  evader  in  a  planar  environment.  The 
motion  of  each  player  is  constrained  to  a  stepping  disc  around 
it.  The  evader  is  initially  located  inside  a  bounded  subset  of 


the  environment,  which  we  term  as  the  field.  The  players  can 
leave  the  field  but  not  the  environment.  The  evader  follows 
a  reactive  rabbit  model,  i.e.,  does  not  move  until  it  senses  a 
pursuer  [14].  We  present  an  algorithmic  approach  in  the  form 
of  a  Sweep-Pursuit-Capture  strategy  for  the  pursuer  to  capture 
the  evader.  We  demonstrate  this  strategy  using  two  variants  of 
the  pursuit-evasion  game:  the  first  involves  a  single  pursuer 
and  the  evader  in  a  bounded  convex  environment  while  the 
second  considers  multiple  cooperating  pursuers  to  capture  the 
evader  in  a  boundaryless  environment. 

In  the  first  game,  the  pursuer  sweeps  the  environment  in  a 
definite  path  until  the  evader  is  sensed,  which  must  necessarily 
happen  in  finite  time.  This  is  analogous  to  the  spanning-tree 
based  coverage  presented  in  [17].  We  then  establish  how  a 
Greedy  strategy  of  moving  towards  the  last-sensed  location 
of  the  evader,  eventually  reduces  the  present  problem  to  a 
previously-studied  one  with  unlimited  sensing.  The  convexity 
assumption  on  the  environment  is  required  because  otherwise, 
due  to  the  limited  sensing  range,  there  exist  environments 
similar  to  those  considered  in  [10]  and  an  evader  strategy, 
such  that  the  evader  does  not  get  detected  again.  Finally,  we 
show  how  capture  is  achieved  using  the  established  Lion 
strategy  [4].  Our  contributions  are  as  follows:  First,  we  present 
an  analysis  which  provides  a  novel  upper  bound  on  the  time 
required  for  the  pursuit  phase  to  terminate.  This  bound  is  an 
improvement  compared  to  our  earlier  work  [18].  Second,  we 
obtain  a  sufficient  condition  on  the  ratio  of  sensing  to  stepping 
radius  of  the  players  for  capture  to  take  place  in  a  given 
environment.  Finally,  we  show  that  this  condition  is  tight  in 
the  sense  that  if  it  is  violated,  then  there  exist  sufficiently  large 
environments,  an  evader  strategy  and  initial  positions  for  the 
players,  that  lead  to  evasion  against  the  Greedy  strategy. 

The  second  game  is  played  with  at  least  five  cooperative  pur¬ 
suers  in  a  boundaryless  environment  and  the  field  is  a  bounded 
region  known  to  the  pursuers.  Our  contributions  are  as  follows: 
First,  we  design  a  novel  pursuer  formation  and  a  randomized 
Sweep  strategy  for  the  pursuers  to  search  the  field.  They 
succeed  when  they  detect  the  evader  inside  a  special  capture 
region,  which  we  characterize  for  the  pursuer  formation.  We 
show  that  using  our  Sweep  strategy,  the  pursuers  succeed 
with  a  certain  probability  which  is  a  function  of  the  pursuer 
formation  and  independent  of  the  initial  evader  location.  Next, 
we  propose  a  cooperative  pursuit  strategy  for  the  pursuers  to 
confine  the  evader  within  their  sensing  discs.  We  show  that 
using  this  pursuit  strategy,  the  present  problem  is  reduced  to 
a  previously-studied  one  with  unlimited  sensing.  Finally,  we 
show  how  capture  is  achieved  using  the  established  PLANES 
strategy  [5],  We  obtain  novel  upper  bounds  on  the  time  for 
each  phase  in  our  strategy,  which  is  an  improvement  compared 
to  our  earlier  work  [19].  Also,  we  present  a  simulation-based 
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study  of  the  performance  of  the  strategies  under  sensing  errors, 
different  ratios  of  sensing  to  stepping  radius,  greater  evader 
speed  and  different  number  of  pursuers. 

The  inspiration  for  the  cooperative  strategy  proposed  in 
this  paper  has  been  derived  from  aspects  of  animal  behavior. 
It  is  well  known  that  predators  hunt  as  a  conjoined  group, 
when  it  is  less  efficient  to  hunt  alone.  This  behavior  is  also 
observed  when  the  prey  is  large  or  can  move  as  fast  as  the 
predators  [20].  Further,  predators  show  an  inclination  towards 
specialized  behavior  by  maintaining  a  fixed  formation  during 
search  and  capture  of  preys  [21].  Such  specializations  suggest 
that  there  may  be  configurations  that  are  preferred  during 
group  hunting.  Also,  in  the  presence  of  sensing  limitations, 
groups  tend  to  maintain  spacing  between  each  other  that  is 
regulated  by  their  sensory  capabilities  [22].  These  facts  give  us 
additional  hints  towards  designing  capture-conducive  predator 
formations.  In  this  context,  our  analysis  sheds  light  on  how 
the  maximum  group  size  of  the  predators  varies  with  prey 
availability  and  with  the  prey’s  nutrition  value  in  the  present 
set-up. 

The  paper  is  organized  as  follows:  the  problem’s  mathe¬ 
matical  model  and  assumptions  are  presented  in  Section  II. 
The  individual  phases  of  the  sweep-pursuit-capture  strategies 
and  the  corresponding  main  results  for  both  the  problems 
are  presented  in  Section  III.  The  proofs  of  the  results  are 
presented  in  Section  IV.  Simulation  results  are  presented  in 
Section  V.  Finally,  in  Section  VI,  we  study  the  relationship 
between  pursuer  group  sizes  and  evader  availability  and  its 
nutrition  value  in  our  set-up. 

II.  Problem  Set-up 

We  assume  a  discrete-time  model  with  alternate  motion 
of  the  evader  and  the  pursuers:  the  evader  moving  first.  We 
assume  that  the  players  can  sense  each  other  precisely  only  if 
the  distance  between  them  is  less  than  or  equal  to  the  sensing 
radius  rsens-  Further,  we  assume  that  at  each  time  instant,  the 
players  take  measurements  of  each  other  before  and  after  the 
evader’s  move,  as  shown  in  Figure  1.  Define  Q,t,  :=  QU  (f>, 
where  Q  C  R2  denotes  the  environment  and  0  is  the  null 
element.  The  null  element  will  be  used  to  denote  a  lack  of 
measurement  in  our  limited  range  sensing  model.  Let  Q  C  Q 
denote  the  field,  i.e.,  the  region  that  initially  contains  the 
evader.  The  evader  follows  a  reactive  rabbit  model  -  moves 
only  after  being  detected  for  the  first  time.  We  assume  that  the 
pursuers  know  the  field  Q  and  the  environment  Q.  The  goal  of 
the  pursuer(s)  is  to  capture  the  evader,  i.e.,  a  pursuer  and  the 
evader  are  at  the  same  position  at  some  finite  time.  Evasion 
is  said  to  occur  if  the  pursuer  cannot  capture  the  evader.  We 
describe  the  Sweep-Pursuit-Capture  strategy  for  the  following 
problems: 


A.  Single  pursuer  problem 

We  have  a  bounded  convex  environment  Q  C  R2  and  the 
field  Q  =  Q.  Let  e[t]  and  p[t]  denote  the  absolute  positions  of 
the  evader  and  the  pursuer  respectively,  at  time  t  £  Z>q.  The 


Evader  moves  to  e[r]  Pursuer  moves  to  p[r] 


Fig.  1.  A  snapshot  of  each  time  instant  r  E  {1,2,...}  in  our  alternate 
motion  model.  The  players  take  measurements  before  and  after  the  evader’s 
move. 


discrete-time  equations  of  motion  are 

e[t\  =  e[t  -  1]  +  ue(e[t  -  1],  {j4f[r]}*=1,  , 

p[t]  =  p[t  -  1]  +  UP(p[t  -  1],  {t/befMlUu  h&MlU) > 

(1) 


where  at  the  rth  time  instant,  t/beflT]>  2/aft  M  e  Q<£  are 
the  measurements  of  the  pursuer’s  position  taken  by  the 
evader  before  and  after  the  evader’s  move,  as  shown  in 
Figure  1.  The  parentheses  notation  { J/bef [T1  }t-=i  denotes  the 
set  { 2/bef [ 1  ] >  ^bef [2] >  •  ■  ■  ,2/befWI-  Due  to  limited  range  sensing 
model,  for  r  £  {1, . . . ,  f},  we  define 


2/befM 


P[r  -  !]> 

0, 


if  || p[r  -  1]  -  e[r  -  1]||  <  rsens 
otherwise. 


(2) 


For  notational  convenience,  we  define  { 2/aft  M}t==i  =  <j>  for 
the  initial  time  t  =  1.  For  t  >  2  and  for  r  £  {1,  1}, 

we  have 


2/aft  M 


P[T  -  1], 

0, 


if  ||p[r  —  1]  —  e[r]||  <  r, 

otherwise. 


(3) 


Similarly,  at  the  rih  time  instant  ,  y^ef[r],  t/^Jr]  G  Q $ 
are  the  measurements  of  the  evader’s  position  taken  by  the 
pursuer  before  and  after  the  evader’s  move  respectively,  as 
shown  in  Figure  1.  Due  to  limited  range  sensing  model,  for 
r  £  we  have 


i&fM 


e[r-  i], 

4> , 


if  ||e[r  -  1]  -p[r  -  1]||  <  rsens 
otherwise. 


(4) 


For  t  G  {1,  ...,£},  we  have 


2/aft  M 


e[r],  if  ||e[r]  -  p[r  -  1]||  <  rsens 

</>,  otherwise. 


(5) 


The  functions  ue  :  Q  x  x  •  •  •  x  — >  Q  and  up  : 

Si  v  ✓ 

2t  —  1  times 

Q  x  0.0  x  •  •  •  x  — *  Q  are  termed  as  strategies  for  the 

- v - ' 

2 1  times 

evader  and  pursuer  respectively.  The  apparent  lack  of  symme¬ 
try  between  the  number  of  arguments  in  the  strategies  of  the 
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evader  and  the  pursuer  is  due  to  the  alternate  motion  model. 
We  assume  that  both  players  can  move  with  a  maximum  step 
size  of  rstep,  that  is, 

||«e||  <  T-step,  lluPH  ^  rstep-  (6) 

The  sensing  radius,  rsens,  is  k  times  the  motion  radius,  rstep.  We 
assume  k  is  greater  than  1,  i.e.,  both  players  can  sense  further 
than  they  can  move.  From  the  reactive  rabbit  model  for  the 
evader,  we  have  ue  =  0  until  the  evader  is  detected.  After  this 
happens,  the  single  pursuer  problem  consists  of  determining 
up  that  guarantees  capture  for  any  evader  strategy,  ue.  This 
problem  is  described  by  two  key  parameters:  the  ratio  of 
sensing  to  stepping  radius  k  and  the  ratio  of  the  diameter  of 
the  environment  to  the  stepping  radius  dlam(S) . 


B.  Multiple  pursuer  problem 

We  have  a  total  of  TV  >  5  pursuers  that  can  communicate 
among  themselves  the  location  of  a  sensed  evader  as  well  as 
their  own  position  with  respect  to  a  fixed,  global  reference 
frame.  The  environment  Q  is  M2  and  the  field  Q  is  a  bounded 
subset  of  R2.  Define  R^  :=  R2  U  </>.  Let  Pj[t)  denote  the 
absolute  positions  of  the  jth  pursuer  at  time  t  for  every  j  £ 
{1, . . . ,  TV}.  Analogous  to  (1),  the  discrete-time  equations  of 
motion  are 

e[t]  =  e[t  -  1]  +  ue(e[t  -  1  ],  {y^f[r]}‘=1,  , 

Pj[t]  =PAt~  !] 

+  uw({{PjM}jL1}T=i,  {2/befM}t=i>  {2/StM}t=i)’ 

(7) 

where  at  the  rth  time  instant,  t/bef[rL  VtfAT\  G  R|  x  •  •  •  x  R^, 

S.  v  ✓ 

N  times 

denote  the  sets  of  measurements  of  the  pursuers’  positions 
taken  by  the  evader  before  and  after  its  move.  Similarly, 
^betHi  J/aftM  £  R2  are  the  measurements  of  the  evader’s 
position  taken  by  the  pursuers  before  and  after  the  evader’s 
move.  The  measurements  are  given  by  expressions  analogous 
to  (2)-(5).  Akin  to  the  single  pursuer  problem,  the  functions 
ue  :  R2  x  R|  x  •  •  •  x  R2  — >  R2  and  uPj  :Rjx--xRjx 

(2 1  —  1)N  times  ( t  —  1)N  times 

R^  x  •  •  •  x  R^  — >  R2  for  every  j  £  {1, . . . ,  TV},  are  strategies 

Si  v 

2 1  times 

for  the  evader  and  pursuers  respectively.  The  constraint  on  the 
maximum  step  size,  given  by  (6),  holds  for  the  evader  and 
every  pursuer.  Due  to  the  reactive  rabbit  model  for  the  evader, 
ue  —  0  until  it  is  detected  by  the  pursuers  for  the  first  time. 

The  multiple  pursuer  problem  consists  of  designing  a  pur¬ 
suer  formation  and  a  corresponding  strategy  that  guarantees 
capture  of  the  evader.  This  problem  is  described  by  the 
following  key  parameters:  the  ratio  of  sensing  to  stepping 
radius  of  the  players  the  ratio  of  the  diameter  of  the  field 
to  the  stepping  radius  dlam(g) ;  and  the  number  of  pursuers  TV. 


III.  The  Sweep-Pursuit-Capture  Strategies  and 
Main  Results 

In  this  section,  we  describe  the  Sweep-Pursuit-Capture 
strategies  for  both  the  problems  and  the  corresponding  main 
results.  The  proofs  are  presented  in  Section  IV. 

We  first  introduce  the  following  weak  notion  of  capture. 


Definition  III.l  (Trap)  The  evader  is  trapped  within  the 
sensing  radius  (resp.  radii)  of  the  pursuer  (resp.  pursuers)  if 
for  any  evader  strategy  ue,  the  motion  disc  of  the  evader  is 
completely  contained  within  the  sensing  disc  of  the  pursuer 
(resp.  union  of  the  sensing  discs  of  the  pursuers)  after  a  finite 
time. 

To  be  specific,  the  evader  is  trapped  at  time  instant  7;rap  if 
for  any  evader  strategy, 

ybefptrap]  =  e[Ttrap  —  1],  and  2/aftPtrap]  =  eptrap]. 

The  idea  behind  our  Sweep-Pursuit-Capture  strategies  is  to 
detect  the  evader  and  pursue  it  so  as  to  trap  it.  Next,  we  show 
that  the  evader  remains  trapped  for  all  subsequent  time  instants 
and  that  the  pursuers  achieve  capture  by  using  strategies  that 
were  developed  for  the  unlimited  range  sensing  version  of  the 
game.  This  principle  applies  to  both  versions  of  the  problem. 


A.  Single  pursuer  problem 

We  first  present  each  phase  of  the  strategy  for  the  single 
pursuer  problem. 

1 )  Sweep  phase  -  Sweep  strategy:  Let  diam(Q)  denote  the 
diameter  of  Q.  The  Sweep  strategy  for  the  pursuer  is  to  move 
with  maximum  step  size  along  a  path,  as  shown  in  Figure  2 
such  that  the  union  of  the  sensing  discs  of  the  pursuer  at  the 
end  of  each  step  until  the  end  of  this  phase  contains  Q.  We 
term  such  a  path  a  sweeping  path  for  Q.  Let  fsweep  denote  the 
time  taken  for  this  strategy  to  terminate.  We  have  the  following 
result. 


Lemma  III.2  (SWEEP  strategy)  In  the  single  pursuer  prob¬ 


lem  with  parameters  n  and 


diam(Q) 


the  time  isweep  taken  by 


the  Sweep  is  at  most 


diam(g) 

‘2  k  rst(.n 


diam(g) 

rstep 


r«l) 


steps. 


Fig.  2.  A  sweeping  path  to  detect  the  evader  in  the  Single  pursuer  problem 
using  the  SWEEP  strategy. 
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2)  Pursuit  phase  -  Greedy  strategy:  Once  the  evader  is 
detected,  the  Greedy  strategy  for  the  pursuer  is  to  move 
towards  the  last  sensed  position  of  the  evader  with  maximum 
step  size.  This  strategy  has  the  property  that  the  pursuer  senses 
the  evader’s  position  at  every  successive  time  instant.  Let  ftrap 
denote  the  trapping  time,  i.e.,  the  time  taken  by  the  pursuer 
to  trap  the  evader  after  detecting  it.  We  now  present  our  main 
result  for  the  Greedy  strategy. 


Theorem  III.3  (Greedy  strategy)  In  the  single  pursuer 
problem  with  parameters  k  and  dlam(S),  if  n  >  \J2  +  2  cos  /3C, 
where 


Pc~ 


v/3  diam(Q) 
4  K  2fiXstep 


arctan  ■ 


8k 


(8) 


then  the  GREEDY  strategy  has  the  following  properties: 


(i)  the  pursuer  traps  the  evader  within  its  sensing  radius, 
and 

(ii)  the  trapping  time  ftrap  satisfies 


/ 

log 

Y  k,2  —  sin2  (3C— cos  /3C  —  1  \ 

\ 

l  -1  J 

+  1 

J 

V 

log(l  - 

diam(Q) 

2  k  T  step 

(9) 


Furthermore,  if  k  >  2,  then  as  (diam(Q)/rstep)  — >  +oo, 
^trap  S  O  ((diam(Q)/rstep)3). 


Theorem  III. 3  is  tight  in  the  sense  that  if  the  condition  on 
k  is  violated  then  there  exist  sufficiently  large  environments, 
an  evader  strategy  and  initial  positions  for  the  players,  that 
lead  to  evasion  against  the  Greedy  pursuer  strategy.  This  is 
described  by  the  following  result. 


Proposition  III.4  (Evasion)  Given  a  single  pursuer  problem 
with  parameters  k  and  such  that  k  <  \J2  +  2  cos  f3c, 

where  /3C  is  given  by  (8),  and  Q  contains  a  circle  of  radius 
-d=p=2,  then  there  exist  an  evasion  strategy  and  initial  posi¬ 
tions  of  the  players  for  which  the  pursuer’s  GREEDY  strategy 
fails  to  trap  the  evader. 

Figure  3  illustrates  this  evasion  strategy  under  the  conditions 
required  by  Proposition  III.4. 


Fig.  3.  Illustrating  evasion.  The  dotted  circles  are  the  player’s  motion  discs 
and  the  solid  circle  is  the  pursuer’s  sensing  disc.  e[t]  and  p[t]  are  on  the 
circle  fl  described  in  Proposition  III. 4  such  that  |  e  [ /.]  —  p  [t]  1 1  =  rstep.  Evader 
chooses  to  move  to  e[t  +  1]  on  S7  with  full  step  size. 


3)  Capture  phase  -  Lion  strategy:  Once  the  evader  is 
trapped  within  the  sensing  range  of  the  pursuer,  the  pursuer 
employs  the  Lion  strategy  from  [4]  to  complete  the  capture. 
For  the  sake  of  completeness,  we  now  give  a  brief  description 
of  the  Lion  strategy,  adapted  to  the  present  problem  setting. 

The  Lion  strategy  can  be  applied  to  this  phase  as  follows: 

(i)  Prior  to  its  (f  +  l)th  move,  the  pursuer  constructs  the 
line  e[f]p[£],  as  shown  in  Figure  4.  Let  this  line  intersect 
the  boundary  of  the  environment  at  a  point  X[t\  such 
that  p[t\  lies  between  e[t]  and  X [£]. 

(ii)  The  pursuer  then  also  constructs  the  line  e[f  +  1]A'[£] 
and  moves  to  the  intersection  of  this  line  with  the  circle 
centered  at  p[t]  and  of  radius  rstep.  Of  the  two  possible 
intersection  points,  the  pursuer  selects  the  one  closer  to 
e[f  +  1]  ■ 


Fig.  4.  Single  pursuer  problem:  Using  the  LION  strategy  to  capture  the 
evader.  The  dotted  circles  represent  the  motion  discs  of  the  players. 


This  construction  guarantees  that  the  intersection  point  X\t\ 
remains  the  same  as  the  point  X[fsweep  +  ftrap]-  for  every  t  > 
fsweep  +  ftrap,  where  fsweep  +  ftrap  is  the  time  at  the  end  of  the 
pursuit  phase.  Denoting  by  £cap  the  time  taken  by  the  pursuer 
to  capture  the  evader  after  trapping  it,  we  have  the  following 
result. 


Theorem  III.5  (LlON  strategy  [4])  In  the  single  pursuer 
problem  with  parameters  k  and  dlam(Q) ,  after  trapping  the 
evader  within  the  sensing  radius  and  using  the  Lion  strategy. 


(i)  the  distance,  || p[t]  —  e\ 
of  time, 

(ii)  the  pursuer  captures  the  evader, 

(  diam(C)  \  2 


is  a  non-increasing  function 


(iii)  tcap  is  at  most 


steps. 


Thus,  our  problem  with  limited  sensing  is  solved  because 
once  the  evader  is  trapped  within  the  pursuer’s  sensing  radius, 
it  remains  trapped  until  capture,  from  part  (i)  of  Theorem  III. 5. 
We  have  also  obtained  an  upper  bound  on  the  total  time  to 

CaptUie,  i.e.,  fsweep  *F  ftrap  T  fcap- 


B.  Multiple  pursuer  problem 

This  section  describes  the  sweep-pursuit-capture  strategy  for 
multiple  pursuers  and  the  corresponding  results.  We  assume 
that  k  >  4  and  N  >  5.  We  define  the  following  formation  for 
multiple  pursuers. 

Definition  III.6  (Trapping  chain)  A  group  of  N  >  5  pur¬ 
suers  {pi, . . .  ,pn}  are  said  to  be  in  a  trapping  chain  forma¬ 
tion  if 
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(i)  P2,  ■  ■  ■  ,Pn-i  placed  counterclockwise  on  a  semi¬ 
circle  with  diameter  equal  to  \\p2  —  Pn- i||, 

(ii)  for  all  j  €  {1, ...  ,N  —  1} 

I \Pj  -  Pj+ ill  =  rs tep^K2  -  25, 

and 

(iii)  Pi,P2,Pn-i,Pn  are  on  the  vertices  of  a  rectangle  such 
that  the  polygon  with  vertices  {pi, . . .  ,p_/v},  in  that 
order,  is  convex  (cf  Figure  5). 


Fig.  5.  A  trapping  chain  formation  for  N  =  9  pursuers.  The  circles  around 
the  pursuers  denote  their  sensing  ranges.  The  lightly  shaded  region  denotes 
the  capture  region  and  the  darkly  shaded  region  along  with  the  lightly  shaded 
one  denotes  the  extended  capture  region. 


(ii)  Form  a  sweeping  path  for  the  square  region  and  sweep 
along  adjacent  strips  as  shown  in  Figure  6. 

The  shaded  region  in  Figure  6  refers  to  the  area  that  would  fall 
in  the  proposed  capture  region  S.  Now  we  are  interested  in 
determining  the  probability  that  an  evader  falls  in  the  shaded 
region  in  Figure  6.  That  is  given  by  the  following  result. 

diam(C/) 


Fig.  6.  Multiple  pursuer  problem:  SWEEP  strategy.  The  shaded  region 
represents  the  region  swept  by  the  capture  region  of  the  trapping  chain. 


We  now  describe  the  Sweep-Pursuit-Capture  strategy  for  the 
multiple  pursuer  problem. 

1 )  Sweep  phase  -  Sweep  strategy:  The  pursuers  begin  by 
placing  themselves  in  a  trapping  chain  formation.  We  define 
the  capture  region  S  for  a  trapping  chain  by 

S=  |J  BPj(rsens)  n  Co{p2,  •  •  ■  ,Pn-i}, 

je{3,...,N-2} 

where  BPj  (rsens)  C  M2  denotes  the  sensing  disc  of  pursuer 
Pj  and  Co{p2)  •  •  •  iPiv-i}  C  R2  denotes  the  interior  of  the 
convex  hull  of  {p2,  ■  ■  ■  ,Pn-i}-  The  lightly  shaded  region  in 
Figure  5  is  the  proposed  capture  region,  S,  for  the  trapping 
chain.  In  the  sweep  phase,  pursuers  wish  to  detect  the  evader 
within  the  capture  region.  We  consider  a  square  region  of 
length  equal  to  diameter  of  the  region  Q,  diain(C)  that  contains 
the  field  Q.  The  pursuers  sweep  this  square  region  in  the 
direction  of  the  normal  to  piPn,  outward  with  respect  to  the 
convex  hull  of  the  pursuers.  For  a  trapping  chain  shown  in 
Figure  5,  we  define  the  effective  length  l  as 

(x/4ac2  —  25  \ 

— — - — -  -  2k  .  (10) 

sin(2?W33y)  / 

As  the  pursuers  move  in  the  direction  described  earlier,  they 
clear  a  rectangular  strip  of  length  diam(C7)  and  width  (+4rsens- 
The  Sweep  strategy  for  the  pursuers  is  as  follows. 

(i)  Choose  the  first  rectangular  strip  at  a  random  distance 
/()  from  one  edge  of  the  square  region  containing  Q 
and  sweep  it  length-wise.  The  distance  Iq  is  a  uni¬ 
form  random  variable  taking  values  in  the  interval 
[— 2rsens,  l  +  2rsens].  Here,  negative  Iq  implies  that  some 
of  the  pursuers  may  begin  sweeping  from  outside  the 
region  Q. 


Theorem  III.7  (SWEEP  strategy)  In  the  multiple  pursuer 
problem  with  parameters  n,  dl^m^)  an(j  jy,  yor  any  projj_ 
ability  distribution  for  the  initial  position  of  the  evader  with 
support  on  Q,  using  the  SWEEP  strategy, 

(i)  the  probability  P  of  detecting  the  evader  inside  S  for  a 
group  of  pursuers  in  a  trapping  chain,  satisfies 

„  l  ,  27 TK 

P  >  -  >1 - =^= - , 

~  l  +  4rsens  “  (V4 k2  -  25 (N  -  3)  +  2t tk) 

and 

(ii)  the  time  tswee p  taken  by  the  SWEEP  strategy  to  terminate 
satisfies 


Cwccp  ^  ' 


-diam(t/)  / 

7t/2  N-j 

Atep  ' 

\J  4k2  —  25(N  —  3)  +  7 tk  ' 

X 

+2Vl^T~SN 

T’step 

Remark  III.8  The  minimum  probability  P  of  the  pursuers 
detecting  the  evader  inside  the  capture  region  by  using  the 
SWEEP  strategy  is  independent  of  the  evader’s  location  in  Q. 
This  means  that  the  best  that  the  evader  can  do  in  the  present 
framework  is  to  locate  itself  initially  with  a  uniform  probability 
in  Q. 


We  shall  see  that  the  pursuers  win  when  the  evader  is 
detected  in  S  by  the  pursuers.  Otherwise,  there  exists  a  path  for 
the  evader  such  that  it  can  avoid  being  captured  indefinitely. 

2)  Pursuit  phase  -  ClRCUMCENTER  strategy:  If  the  evader 
is  detected  within  the  proposed  capture  region  at  time  fswee P, 
the  pursuers  need  to  ensure  that  they  trap  the  evader  within 
their  sensing  ranges.  Before  we  describe  the  strategy  for  the 
pursuit  phase,  consider  the  following  possibility:  if  the  evader 
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steps  into  the  darkly  shaded  region  of  the  sensing  range  of 
P2  (or  of  pn~ i),  then  p2  (resp.  pn-i)  can  use  the  Greedy 
strategy  (ref.  Section  III-A2).  By  moving  towards  the  evader, 
the  evader’s  motion  disc  gets  contained  inside  that  pursuer’s 
sensing  disc  and  thus  the  evader  gets  trapped.  This  motivates 
us  to  define  an  extended  capture  region  S'  for  the  trapping 
chain  by 

Se  =  (J  BPj(rsens)  G  Co{p2,  ■  •  ■  ,Pn-i}- 

The  darkly  shaded  region  along  with  the  lightly  shaded  region 
in  Figure  5  is  the  extended  capture  region  Se  for  the  trapping 
chain. 

We  now  present  the  following  pursuit  strategy.  At  each  time 
Step  t  ^2  fsweepj 

(i)  While  e[t  +  1]  ^  <Se [t] ,  the  pursuers  P2,  ■  ■  ■  ,Pn- i  move 
towards  the  circumcenter 1  O  of  the  triangle  formed  by 
P2 [f sweep] 5  e[fsweep]  and  pN_ 1 [tsweep]  with  maximum  step. 
Pursuers  pi  and  p\-  move  parallel  to  p2  and  pn-i 
respectively. 

(ii)  Otherwise,  one  of  the  pursuers  which  senses  the  evader 
directly,  makes  a  Greedy  move  (ref.  Section  III-A2) 
towards  the  evader  and  the  others  move  parallel  to  that 
pursuer  with  the  maximum  step. 

One  such  move  is  shown  in  Figure  7.  In  case  (i)  of  the 
strategy,  note  that  the  pursuers  may  not  sense  the  evader  in 
every  subsequent  move.  But  they  will  encircle  the  evader  by 
“closing”  the  trapping  chain  around  it  and  then  shrink  the 
enclosed  region  around  the  evader.  We  thus  have  the  following 
result. 


Fig.  7.  Multiple  pursuer  problem:  CIRCUMCENTER  strategy.  At  time  £Sweep, 
the  evader  position  is  sensed  by  p^.  Pursuers  P2,  •  •  •  ,P8  move  towards  O,  the 
circumcenter  of  triangle  formed  by  p2,  e  and  ps-  pi  and  pg  move  parallel  to 
P2  and  ps  respectively.  The  circles  around  the  pursuers  represent  their  sensing 
discs. 


Theorem  III.9  (CIRCUMCENTER  strategy)  In  the  multiple 
pursuer  problem  with  parameters  n,  dlam(g)  and  j\r,  starting 
from  a  trapping  chain  formation,  if  the  evader  is  detected  with 
c [f sweep]  G  5[tSweep],  then  using  the  CIRCUMCENTER  strategy, 
(i)  the  pursuers  trap  the  evader  within  their  sensing  radii, 

Die  circumcenter  of  a  triangle  is  the  unique  point  in  the  plane  which  is 
equidistant  from  all  of  its  three  vertices. 


(ii)  the  trapping  time  itrap  satisfies 

ttrap  <  a/ 4^C2  -  2§n(i  +  -  1  - )  , 

V  2  sin  0/ 

where 

6(k)  = - arctan  (  -  .  : 

4  W3/c2  -25/ 

and 

(iii)  at  that  time,  the  evader  is  inside  the  pursuers’  convex 

hull  in  such  a  way  that 

B  req>  {(’.  sweep  +  ftrap])  C  Co{pi,  .  .  .  ,PN}[t  sweep  T  ftrap]  ■ 

2  (ID 

The  Circumcenter  strategy  guarantees  trapping  of  the 
evader  even  without  pursuers  p\  and  p.y .  But  in  that  case,  the 
inclusion  in  (11),  which  will  be  required  to  establish  an  upper 
bound  on  the  time  for  the  capture  phase  that  follows,  is  not 
guaranteed. 

3 )  The  Capture  phase  -  Planes  strategy:  Once  the  evader 
is  trapped  within  the  sensing  ranges  of  the  pursuers,  the 
pursuers  use  the  PLANES  strategy  from  [5]  to  capture  the 
evader.  Before  stating  our  results,  we  reproduce  this  strategy 
for  completeness. 

Let  the  time  at  the  end  of  the  pursuit  phase  be  £sweep  +  ftrap 
and  the  evader  be  inside  the  convex  hull  of  the  pursuers  as 
in  (11)  (cf.  Figure  8).  For  t  >  fsweep  +  ttI ap  and  for  every 
pursuer  pj : 

•  Draw  the  line  hj[t  +  1]  through  e[t  +  1],  parallel  to  the 
line  joining  e[t]  and  Pj[t\,  as  shown  in  Figure  9. 

•  Move  to  the  point  closest  to  e[t  +  1]  on  the  line  hj  [t  +  1] 
with  maximum  step  size. 


Fig.  8.  Multiple  pursuer  problem:  evader  trapped  inside  pursuers’  convex 
hull. 


Fig.  9.  Multiple  pursuer  problem:  PLANES  strategy.  Draw  the  line  hj  [t  +  1] 
through  e[t+l],  parallel  to  the  line  segment  e[t]py[t]  and  move  onto  it  closest 
to  the  evader. 

Theorem  III. 9  shows  that  use  of  the  CIRCUMCENTER  strat¬ 
egy  in  the  pursuit  phase  leads  to  the  evader  being  trapped  and 
inside  the  convex  hull  of  the  pursuers.  Now  capture  follows 
from  the  following  theorem,  which  was  partly  inspired  by  the 
results  on  the  PLANES  strategy  in  [5], 

Theorem  III.10  (PLANES  strategy)  In  the  multiple  pursuer 
problem  with  parameters  n,  dlam(g)  ancL  jV,  let  the  evader 
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be  trapped  inside  the  convex  hull  of  the  pursuers  such  that 
property  (11)  is  satisfied.  If  every  pursuer  follows  the  PLANES 
strategy,  then 

(i)  the  distances,  ||pj[t]  —  e\t\  ||  for  every  j  £  {1, . . . ,  N}, 
are  non-increasing  functions  of  time, 

(ii)  the  pursuers  capture  the  evader  and 

(iii)  the  time  tc ap  taken  in  the  capture  phase  is  at  most 
18k V4k2  -257V. 

Item  (i)  of  Theorem  III.  10  implies  that  once  the  evader  is 
trapped  within  the  sensing  ranges  of  the  pursuers,  it  remains 
trapped  within  their  sensing  ranges  at  the  end  of  every  pursuer 
move.  The  capture  is  now  complete  and  we  obtained  a  novel 
upper  bound  on  the  total  time  to  capture,  i.e.,  4weep+itrap+icap- 

IV.  Proofs  of  the  Results 

In  this  section,  we  formally  prove  the  main  results. 


A.  Single  pursuer  problem 


Proof  of  Lemma  III.  2:  To  determine  an  upper  bound  for  tsweep, 
consider  placing  Q  inside  a  square  region  of  length  diam(  Q) 
and  the  pursuer  moving  along  a  hypothetical  sweeping  path  for 
the  square  region,  as  shown  in  Figure  10.  It  is  straightforward 
to  show  that  to  achieve  coverage,  this  hypothetical  sweeping 
path  is  between  strips  of  width  2 Krstep,  parallel  to  the  side. 


selection  of  width  for  the  There  are 


diam(C) 

2  K'f  i,|ep 


such  strips 


and  it  takes  at  most 


diam(g) 


+  |~k~|  steps  to  sweep  one  strip 


completely  and  be  positioned  to  sweep  through  a  neighboring 
strip  of  this  hypothetical  sweeping  path.  ■ 


Fig.  10.  A  hypothetical  sweeping  path  to  determine  upper  bound  on  number 
of  steps  to  detect  the  evader. 

To  prove  Theorem  III. 3,  we  need  some  preliminary  defi¬ 
nitions  and  results  which  we  present  now.  In  what  follows, 
the  notation  ZABC  refers  to  the  smaller  of  the  two  angles 
between  segments  AB  and  BC. 

Definition  IV.l  (Deviation  and  evasion  angles)  Given 
evader  and  pursuer  at  positions  e[r],  p[r],  for  r  £  {t,t  +  1}, 
define  the  deviation  angle  a[t\  and  the  evasion  angle  /3[t\  by: 

a[t ]  :=  Ze[t  +  1  }p[t  +  l]e[i], 
f3[t\  :=  a[t]  +  Zp[t  +  l]e[t  +  l]e[t]. 

These  angles  are  illustrated  in  Figure  1 1 .  The  following  result 
follows  trivially. 


Proposition  IV.2  When  the  pursuer  uses  the  Greedy  strat¬ 
egy,  for  every  instant  of  time  t, 

\m\>wt]\-  (i2) 

Note  that  equality  in  (12)  only  holds  when  the  evader  moves 
away  from  the  pursuer  along  the  line  p[f]e[t]. 


Fig.  11.  Relation  between  the  deviation  angle  a[t\  and  the  evasion  angle 
0[t],  The  dotted  circles  represent  the  motion  discs  of  the  players.  The  circle 
centered  at  p[t]  (shown  partially  here)  is  the  pursuer’s  sensing  range. 

It  can  be  deduced  that  when  the  pursuer  employs  the 
Greedy  strategy,  the  distance  between  the  pursuer  and  evader 
is  a  non-increasing  function  of  time.  We  now  define  a  geomet¬ 
ric  construction  which  is  useful  in  the  proof. 

Definition  IV.3  (Cone  sector  sequence)  Let  to  denote  the 
time  at  the  end  of  the  sweep  phase.  Given  a  time  instant 
k  £  Z>o,  the  sequence  Ck  of  cone  sectors  Ck,i  for  i  £  Z>o  is 
defined  as  follows: 

(i)  Define  the  cone  sector  Ck, o  with  p[tv]  as  its  vertex,  angle 

bisector  defined  by  the  segment  e[tk]p[fk]  ond  extended  to 
a  point  X  beyond  e\tf\  such  that  Lc one  :=  ||p[tk]  —  ||  = 

2Krstep,  as  shown  in  Figure  12.  Let  the  segment  Y Z  be  of 
length  and  perpendicular  to  the  segment  pf^]  X  with 
X  as  its  midpoint.  Accordingly,  let  6  :=  ZYp[tf\Z  = 
2  arctan(l/8K)  be  the  cone  angle. 

(ii)  For  k,i  >  0,  denote  by  t*  the  time  when  the  evader 
leaves  the  cone  sector  Ck,%.  There  are  two  possibilities: 

(a)  the  pursuer  first  constructs  a  new  cone  sector  Ck,i+ 1 
which  is  a  translation  ofCky  having  vertex  at  p[t*].  This 
is  illustrated  in  Figure  13. 

(b)  If  the  evader  is  not  inside  Ck,i+i,  then  we  denote 
tk+i  '■=  t*.  The  pursuer  constructs  a  new  cone  sector 
sequence  Ck+ 1- 

The  cone  sector  sequence  described  above  has  the  following 
property. 


Proposition  IV.4  (Cone  sector  sequence)  Given  a  cone  sec¬ 
tor  sequence  Ck,  the  maximum  number  of  steps  N*  for  which 
the  evader  can  remain  inside  it  without  being  captured  satisfies 


4k  diam(Q) 
VS  2K7"step 


(13) 


Proof:  We  compute  an  upper  bound  on  the  number  of 
steps  a  pursuer  stays  in  any  cone  sector  while  using  the 


Fig.  12.  Construction  of  cone  Co,o-  Choose  X  on  the  line  e[to]p[to]  such 
that  ||p[i0]  —  .V |  =  2ftrstep.  YZ  has  length  and  is  perpendicular  to 
segment  p[to]X  with  X  as  its  midpoint.  6  is  the  cone  angle. 


D  Pi  Pi  C 


A 

Y 


Fig.  14.  Upper  bound  on  the  number  of  steps  a  pursuer  can  be  inside  a 
cone  sector.  The  cone  sector  Ck,o  is  illustrated  here.  The  dotted  path  shows 
a  hypothetical  pursuer  path  that  takes  the  maximum  number  of  steps  before 
leaving  a  cone  sector. 


Fig.  13.  Construction  of  cone  sector  Ck  i+ 1-  Translate  cone  sector  Ck  i  to 
have  its  vertex  at  p' . 


Greedy  strategy.  From  the  definition  of  a  cone  sector,  this  is 
also  an  upper  bound  on  the  number  of  steps  the  evader  can 
remain  inside  a  cone  sector.  Construct  a  rectangle  with  length 
LCone  and  width  such  that  it  contains  a  cone  sector,  as 
shown  in  Figure  14.  Orient  a  frame  of  reference  such  that  its 
X  axis  is  parallel  to  the  angle  bisector  of  the  cone  sector.  Let 

p[tk\—P\ - •  —  P5  denote  the  path  as  a  result  of  the  pursuer’s 

Greedy  strategy  while  in  the  cone  sector  Ck,o- 

We  now  construct  a  path  with  step  size  equal  to  rstep  at  each 
time  instant,  whose  length  is  greater  than  or  equal  to  that  of 
any  such  greedy  pursuer  paths.  Select  a  point  P[  between  A 
and  B  such  that  \\p[tk]  —  P{ ||  =  rstep.  Then  select  another 
point  P2  on  between  C  and  D  such  that  \\P[  —  P2\\  =  rste P. 
Of  the  two  possible  points,  select  that  point  which  is  farther 
from  p[tk]  as  P2.  Selecting  odd  P/’s  between  A  and  B  and 
even  P/’s  between  A  and  B  until  it  is  not  possible  to  select 
any  more  of  the  P/’s  on  segments  AB  and  CD.  This  is 
illustrated  in  Figure  14.  This  construction  leads  to  the  property 
that  the  X  coordinates  of  the  P/’s  are  smaller  than  those  of  the 
corresponding  Pf s.  Thus  the  number  of  P/’s  is  greater  than  or 
equal  to  the  number  of  Pf s.  Thus,  the  path  p[tk]  —  P[  —  ■  ■  •— P/ 
has  its  length  at  least  equal  to  that  of  p[tk\— Pi  —  ■  •  ■— P5.  Since 
the  length  of  segment  AB  is  Lc one,  the  number  of  steps  of 
such  a  path  is  at  most  equal  to  Lc one  divided  by  the  difference 
in  X  coordinates  of  any  two  consecutive  P/’s,  i.e.,  |~  Jf°ne  ]. 

—  Pstep 

Since  Q  has  a  finite  diameter,  there  can  be  at  most  rdli/m(0)l 
cone  sectors  in  any  cone  sector  sequence.  Thus,  the  upper 
bound  (13)  is  established.  ■ 

We  now  state  two  additional  results  needed  to  prove  Theo¬ 
rem  III. 3. 


Proposition  IV.5  (Maximum  evasion  angle)  If  the  pursuer 
uses  the  GREEDY  strategy  and  if  ( n 2  —  l)r2t  >  s2[t],  where 
s[t]  =  ||p[i]  —  e[f]||,  then  define 


Ana x[t]  ■=  arccos 


-lh2tep-S2M 

2s[f]rstep 


(14) 


If  at  some  time  t,  /3[t]  >  /3max[t\,  then  the  pursuer  moves 
towards  e[t  +  1]  and  traps  the  evader. 

This  result  is  obtained  by  applying  the  cosine  rule  to 
Ap[t]e[f]e[t+1],  where  the  notation  A  ABC  stands  for  triangle 
formed  by  points  A ,  B  and  C,  as  shown  in  Figure  15.  . 


Fig.  15.  Constraint  on  maximum  evasion  angle.  The  dotted  circle  represent 
the  evader’s  motion  disc.  The  circle  centered  at  p[t]  (shown  partially  here)  is 
the  pursuer’s  sensing  range. 


Lemma  IV.6  (Constraint  on  maximum  evasion  angle) 

For  the  evader  to  move  out  of  a  cone  sector  sequence 
Ck,  described  in  Definition  IV.3,  there  exists  a  time  instant 
t  £  [tk ,  tk- pi  [  (ref.  Definition  IV.3 )  such  that 

d5) 

where  N*  is  defined  in  Proposition  IV.4. 

Proof:  For  the  evader  to  move  out  of  the  cone  sector 
sequence  Ck,  the  sum  of  the  angles  of  deviation  for  the  pursuer 
must  exceed  half  of  the  cone  angle  6,  i.e., 

tk+i  a 

Ei«wi>  o- 

t-tk 

Geometrically,  this  condition  implies  that  the  angle  between 
the  vectors  e[tk+ 1]  —p[tk+ 1]  and  e[tk]  —  p[tk]  must  be  at  least 
|.  This  is  illustrated  in  Figure  16.  From  Proposition  IV.2,  it 
implies  that 

tk+i  „ 

E  \^\  >  o- 

t=tk 

Equation  (15)  now  follows  from  the  fact  that  tk+ 1  —tk<  N*, 
for  every  k,  since  there  exists  a  maximum  number  of  time 
steps  N*  for  which  the  evader  can  remain  inside  any  cone 
sector  sequence,  as  derived  in  Proposition  IV.4.  ■ 

We  are  now  ready  to  prove  Theorem  III.  3. 

Proof  of  Theorem  III. 3:  Two  cases  need  to  be  considered: 
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Fig.  16.  Illustrating  Lemma  IV.6.  This  is  a  case  of  the  evader  moving  out 
of  the  cone  sector  sequence  C k  by  moving  out  of  Ck, o- 


(i)  Evader  does  not  move  out  of  a  cone  sector  sequence'. 
Capture  follows  from  the  construction  of  the  cone  sector 
sequence  and  from  Proposition  IV.4. 

(ii)  Evader  moves  out  of  a  cone  sector  sequence'.  In  this 
case,  we  seek  to  show  that  the  evader  cannot  keep  moving 
out  of  an  arbitrarily  large  number  of  cone  sector  sequences.  If 
the  evader  leaves  the  cone  sector  sequence  Ck,  then  for  some 
T  e  { tk , . . .  ,tk+ 1  -  l},/3 [t]  >  /3C.  Applying  the  cosine  rule 
to  Ap[r]e[r]e[r  +  1],  we  obtain 


s2[r  +  l]  =  rs2tep  +  (s[r]-rstep)2  +  2rstep(s[r]-rstep)cos/3[r], 
=>  s2  [t]  —  s2  [t  +  1]  =  2rstep(s[r]-rstep)(l-cos/3[r]). 

Using  equation  (15)  and  since 


s[t]  +  s[r  +  1]  <  2«Xstep, 


we  obtain 


r  ,  i i  .  ,  (I-COS(^)) 

3[r  +  1J  -  rstep  <1 - — - 


(s[r] -rstep).  (16) 


Defining  \k  '■=  s[A]  —  rstep,  we  conclude  that 
Xfc+i  <  s[r  +  1]  -  rstep, 

(l-cos(^r)) 


<  1- 


(s[t]  -  7-step), 


<  1- 


(!  -  COs(^r)) 


Xk, 


(17) 


where  the  first  and  third  inequalities  follow  from  the  fact  that 
distance  s[t]  is  non-increasing  in  the  Greedy  strategy  and 
the  second  inequality  follows  from  equation  (16).  Recall  that 
n  >  1  by  assumption  and  hence  the  term  in  the  parenthesis  is 
positive  and  strictly  less  than  1.  Thus,  \k  — ►  0  asymptotically, 
i.e.,  the  distance  between  the  pursuer  and  evader  tends  to  7"step 
asymptotically.  Moreover,  for  k  >  2,  the  distance  reduces  to 
(k  —  l)rsteP  after  a  finite  time.  Thus,  the  motion  disc  of  the 
evader  will  become  completely  contained  within  the  sensing 
disc  of  the  pursuer.  Hence,  the  result  follows. 

The  case  of  n  =  2:  We  have  seen  that  the  distance  s[f] 
between  the  pursuer  and  evader  tends  asymptotically  to  rstep. 
From  Proposition  IV.5,  we  obtain  that  as  s[f]  — >  rstep,  the  angle 
Anax  — >  0.  So,  after  some  finite  time. 


Anax  <  2^\f*  — '  @c' 

Thus,  the  evader  becomes  confined  to  the  present  cone  sector 
sequence  according  to  Lemma  IV.6  and  from  Proposition  IV.4, 


and  we  can  see  from  part  (i)  of  this  proof  that  the  pursuer  traps 
the  evader  within  its  sensing  radius. 

If  n  <  2:  We  have  seen  that  at  each  time  step  t,  there  is 
a  maximum  value  /3max[t]  of  the  evasion  angle  /3[t],  so  that 
the  evader’s  next  step  e[t  +  1]  is  not  in  the  pursuer’s  sensing 
disc  centered  at  p[t\.  This  is  shown  in  Figure  17.  The  key 
idea  of  this  part  of  the  proof  is  that  if  we  ensure  that  for  all 
subsequent  times  after  a  certain  time  t* ,  Anax[£]  is  less  than 
the  minimum  value  A  (cf.  Lemma  IV.6)  needed  for  the  evader 
to  leave  a  cone  sector  sequence,  then  the  evader  is  forced 
to  remain  inside  a  final  cone  sector  sequence  and  trapping 
follows  from  part  (i).  In  previous  cases,  we  have  seen  that  the 


Fig.  17.  Illustrating  parameters  in  Equation  (18).  e'[t*]  is  a  point  such  that 
||p[t*]  —  e' [f*]  |  =  r^p.  7  is  the  value  of  the  maximum  evasion  angle  if  the 
evader  were  at  e,[t*].  The  circle  of  radius  ftrstep  around  the  pursuer  (shown 
partially  here)  is  the  pursuer’s  sensing  disc  which  the  dotted  circle  around 
e[t*]  is  the  evader’s  motion  disc. 


Greedy  strategy  reduces  the  distance  s[f]  asymptotically  to 
v step .  Thus  after  a  finite  time  t* ,  s[t*}  attains  a  value  such  that 
the  maximum  evasion  angle  is  less  than  or  equal  to  (1  +  6) 7, 
where  7  is  the  maximum  evasion  angle  if  the  evader  is  at  e'[f*], 
which  satisfies  ||p[f*]  —  e'[f*]||  =  rstep  and  6  is  a  given  positive 
number.  At  this  time  instant  £fmai ,  let  the  pursuer  construct  a 
new  cone  sector  sequence,  Ciinai-  So,  if 

(1  +  S)"fN*  =  (18) 

where  N*  and  9  are  defined  in  Proposition  IV.4  and  in 
the  definition  of  a  cone  respectively,  then  for  some  r  £ 
{Anal,  •  •  • ,  Anal  +  N*},  (3[t\  >  (1  +  <5)7  =  A  for  the  evader  to 
leave  (Anal-  This  means  that  the  evader  is  forced  to  step  inside 
the  sensing  disc  of  the  pursuer  or  to  remain  inside  the  final 
cone  Cfinai.  In  both  cases,  the  pursuer  traps  the  evader  within 
its  sensing  radius.  From  equation  (18), 


Applying  the  cosine  rule  to  +  1], 

n  =  yj 2  +  2  cos  7  >  \/ 2  +  2  cos  A- 

Thus,  we  have  shown  that  if  n  >  y/2  +  2  cos  A,  then  the 
pursuer’s  Greedy  strategy  guarantees  that  the  evader  is 
trapped. 

Computing  an  upper  bound  on  the  trapping  time'.  We  have 
seen  that  when  the  pursuer  uses  the  Greedy  strategy,  the 
evader  cannot  leave  an  arbitrarily  large  number  of  cone  sector 
sequences.  Thus,  to  compute  an  upper  bound  on  the  trapping 
time,  we  compute  an  upper  bound  on  the  number  of  cone 
sector  sequences  that  the  evader  can  leave.  We  have  seen 
that  using  the  Greedy  strategy,  /3max  <  A  after  finite  time. 
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From  (14),  we  can  determine  that  distance  sc  for  which 
Anax  =  A ,  so  that  subsequently,  the  evader  is  confined  to 
the  final  cone  sequence: 


Sc 


COS  A  )  C step  • 


If  k  is  the  final  cone  sequence  index,  then  using  equation  (17), 


Sc  -  r step  <  Xk  <  Ax'fc-1  <  •  •  •  <  Afc(«  -  1 ) f step ; 

where  A  =  1— 1  co^ 2W* -1  and  the  worst-case  xo  =  (k—  l)rstep. 
Upon  simplifying,  we  obtain 


k  < 


i  /  \/^2  —  sin2  /3C— cos  /3C— 1 

log  (  * - — - 

l°g(A) 


The  result  now  follows  from  the  fact  that  for  the  case  of  k  <  2, 
we  construct  an  extra  final  cone  sequence  and  the  maximum 
number  of  steps  in  each  cone  sequence  can  be  at  most  N*. 
The  asymptotic  result  follows  by  routine  simplifications.  ■ 
Proof  of  Proposition  III. 4:  We  prove  this  result  by  determining 
a  set  of  initial  conditions  and  an  evader  strategy  that  leads  to 
evasion.  Suppose  at  time  t,  the  pursuer  and  the  evader  are  on 
a  circle  U  with  radius  p  =  7j'ep  2 ,  such  that  ||e[t]  —  p[t]  ||  = 
rstep  as  shown  in  Figure  3.  The  evader  is  not  trapped  as  its 
motion  disc  is  not  completely  contained  inside  the  pursuer’s 
sensing  disc.  An  evader  strategy  is  to  choose  a  point  e[t  +  1] 
on  such  that  ||e[f]  —  e[t  +  1]||  =  rstep.  Since  p  =  , 

e[t  +  1]  lies  outside  the  pursuer’s  sensing  disc  before  its  move 
at  time  t  +  1.  By  the  Greedy  strategy,  p[t  +  1]  =  e{t\.  Thus, 
||e[f  +  1]  —  p[t  +  1]||  =  rstep  and  the  evader  can  avoid  getting 
trapped.  ■ 


B.  Multiple  pursuer  problem 

We  first  state  a  property  of  the  effective  length  of  the 
trapping  chain. 


Proposition  IV.7  The  effective  length  of  the  trapping  chain 
satisfies 


2(V4k2  -  25(AT  -  3)  —  7 Tft) 

7 r 


l 

<  -  < 

^step 


\J 4k2  -  257V  -  2k. 


Proof:  The  left  hand  side  of  the  inequality  follows  from 
the  fact  that  the  circumference  of  the  circle  passing  through 
the  vertices  of  the  trapping  chain  is  greater  than  the  sum  of  the 
distances  of  neighboring  vertices.  The  right  hand  side  follows 
from  repeated  use  of  the  triangle  inequality.  ■ 

Proof  of  Theorem  III.  7:  Let  the  evader  be  located  at  a  point 
Ye  £  Q  and  let  its  distance  from  the  edge  AB  be  y,  as  shown 
in  Figure  18.  Note  that  the  distance  of  the  evader  from  the 
edge  AD  does  not  play  any  role  in  what  follows.  The  main 
idea  behind  this  proof  is  as  follows:  Ye  would  lie  in  the  shaded 
region  in  Figure  18  if  (o  +  2r-sens  <  y  or  lo  —  2rsens  >  y.  This  is 
equivalent  to  choosing  a  length  equal  to  the  effective  length  l 
of  the  trapping  chain  from  a  total  length  l  +  4rsens,  if  we  let  Iq 
take  a  uniformly  random  value  from  [— 2rsens,  Z  +  2rsens].  Thus, 
the  probability  of  success  for  the  pursuers  is  at  least  the  ratio 
of  l  to  l  +  4rsens- 


To  be  more  specific,  let  the  spatial  probability  density 
of  the  Y  coordinate  of  the  evader  inside  Q  be  p(y).  Thus, 
./(jll;Un  ^  P('y)dy  =  1.  The  probability  that  the  evader  is  detected 
inside  the  capture  region  S  is  given  by 


P(e  £  S) 


P(e  £  iS|/q  =  k)p(lo  =  k)dlo, 


where  k  £  [— 2rsens,  Z  +  2rsens]-  Assuming  the  pursuers  have  no 
information  about  the  evader’s  location  inside  Q,  Iq  is  chosen 
uniformly  randomly  from  [— 2rsens,  l  +  2rsens].  Hence,  we  have 


/£  +  2rSens  1 

P(e  £  S\Iq  =  k)- - - - dlo 

-2rsens  l  +  4rsens 

1  W+2rsens  Wo+2rsens+i7(Z+2rsens)+J 

(J2  I  p{y)dv 


l  +  4r, 

+ 


sens  J  —  2rsens  j=0  'Ao+2rsens+j(t+2rsens) 

/>diamC/ 

f  p{y)dy)dlo, 

Zo+2rSens  +  ^(^+4 

T  sens  )  +  * 


where  n  :=  [diam Q/{1  +  4rsens)]  is  the  number  of  times 
the  pursuers  sweep  to  clear  the  entire  environment.  Since  the 
variables  Iq  and  y  are  independent  (cf.  Figure  19  for  the  region 
of  integration),  changing  the  order  of  integration  gives 


P{e  €  S)  > 


n~1  r(j+l)(lo+4rm) 

J2  I  p(y)f(y,i)dy 


l  +  4rsens  j_Q  Jj(l0+4rsens ) 

1  pn(l0-h4rsem ) 


l  +  4rsens  J o 


p(y)f{yj)dy, 


where 


f(y,  0 


ry—^'si 
^  2rsens 
ry  2T*sens 
<^y-2rsens—l 


s  dlo  +  /, 

(Uq 


y+2rse 


;  J7 

din 


for  y  <  /, 
otherwise. 


In  both  cases,  f(y ,  l)  =  l.  Thus,  the  minimum  probability 
of  success  is  1/(1  +  4rsens),  since  J^do+4rsa“')  p(y)dy  = 

/0diama  p(y)dy  +  /d”ame4rs“s)  p(y)dy>  and  p(y)  =  °  outside 
of  Q.  The  second  inequality  in  part  (i)  follows  by  use  of  the 
left  hand  inequality  in  Proposition  IV.7.  The  reason  why  this  is 
a  lower  bound  on  the  required  probability  is  that  if  the  pursuers 
had  some  information  about  the  evader’s  location,  then  they 
could  choose  Iq  randomly  from  a  smaller  interval  than  the 
current  one  and  thus  increase  the  probability  of  detecting  the 
evader  in  the  capture  region. 

From  the  Sweep  strategy,  the  width  of  each  strip  swept 
is  l  +  4r sens*  So  the  maximum  number  of  strips  after  which 
the  sweep  phase  terminates  is  ^  takes  at  most 

diam(g)+2(;+2)-sem)  steps  fQr  pursuers  tG  dear  a  strip 
followed  by  aligning  themselves  parallel  to  the  adjacent  strip. 
The  result  now  follows  using  Proposition  IV.7.  ■ 

To  prove  Theorem  III.9,  we  first  establish  the  following 
properties  of  a  trapping  chain.  In  what  follows,  given  points 
a,b,c  £  K2,  the  notation  dist(a,  be)  is  the  distance  of  point  a 
from  the  line  be. 


Lemma  IV.8  (Trapping  chain  properties)  If  e[t)  £  S\l), 
then  the  following  statements  hold: 
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Fig.  18.  Illustrating  the  proof  of  Theorem  III. 7. 


Fig.  19.  The  region  of  integration  in  determining  P(e  G  S)  in  the  proof  of 
Theorem  III. 7.  Fix  a  value  of  lo  in  the  interval  (— 2rsens,J  +  2rsens)  to  get 
the  values  of  y  that  correspond  to  P(e  G  <S). 


(i)  If  dist(e[t},pj[t]pJ+1[t})  >  §rstep,  for  all  j  £ 
{1,...,N  —  1},  then  the  evader  cannot  step  out¬ 
side  Co{pi[f], . . .  ,pAr[i]}  at  time  t  +  1  by  crossing 
Pj[t]Pj+i[t]- 

(ii)  If  dist{e[t],pj[t\pj+i[t})  <  |rstep  or  dist(e[f  + 
1]>Pj[t\Pj+ iM)  <  Irstep,  for  some  j  £  {1, . . . ,  N  -  1}, 
then  the  evader  is  trapped  within  the  sensing  radii  of 
pursuer  pj  or  Pj+i  or  of  both  pj  and  Pj+i. 

(iii)  There  exists  a  tf  >  0,  independent  of  N,  such 

that  for  every  point  q  £  Uje{3,...,Lf  J+i}  ®Pi(rsem)  O 

Co{p2,...,PN- i}, 

Zqp2pN-i  >  </>, 

Proof:  Parts  (i)  and  (ii)  follow  trivially  from  the  defini¬ 
tions  of  trapping  within  sensing  radii  and  from  the  construction 
of  the  trapping  chain.  For  part  (iii),  we  can  see  that  for 
j*  =  [Aj  +  1  if  q  (in  the  specified  set)  is  the  point  of 
intersection  of  the  tangent  from  p2  to  the  sensing  disc  of  pj », 
then  the  angle  Zqp2PN-i  is  minimized.  This  follows  from 
the  fact  that  the  line  p2Pj*  is  parallel  to  p:>,  —  pp~ i.  This 
angle  is  minimum  when  N  =  5.  Thus,  given  a  n  >  4,  from 
trigonometry,  we  obtain 

7T  (  K  \ 

6  = - arctan  :  . 

4  V  \/2>n2  -  25^ 


The  use  of  the  ClRCUMCENTER  strategy  in  the  pursuit 
phase  and  the  geometry  of  the  trapping  chain  gives  us  the 
following  result. 

Lemma  IV.9  If  the  evader  is  trapped  within  the  union  of 
the  sensing  radii  of  pursuers  at  time  taiip,  for  every  j  £ 
{1, . . . ,  N  —  1},  then 

dist(e[ftrap],  Pj  [£trap]Pj+l  [^trap]  )  ^  * 

Proof:  Let  the  evader  be  trapped  at  time  fmip  in  the 
sensing  radii  of  the  pursuers.  From  part  (ii)  of  Lemma  IV.8, 
at  time  itraP  —  1  and  for  every  j  £  {1, . . . ,  N  —  1}, 

3 

dist(e[ftrap  1  ]  •  Pj  fLiap  l]Pj-(-l  [^trap  4])  ^  ^stcp • 

Thus,  immaterial  of  where  the  evader  decides  to  step,  its 
distance  from  Pj[ftrap  —  l]pj+i[ftraP  —  1]  is  greater  than 
Two  cases  are  possible: 

(a)  the  evader  steps  inside  the  sensing  disc  of  some  pursuer 

Pj.-  There  are  two  further  possibilities.  If  dist(e[ftrap],Pj[f:tiap  — 
l]pj+i[ftraP  —  1])  <  4 f step 5  then  the  evader  is  trapped  by 
part  (ii)  of  Lemma  IV.8  and  the  present  lemma  is  proven. 
Else,  we  have  dist(e[ftlaP],Pj[ftrap  -  l]pj+i[4-aP  -  1])  >  f^step- 
Now,  pj  uses  part  (ii)  of  the  ClRCUMCENTER  strategy.  Even 
in  this  case,  dist(e[ftrap],pj[ftrap]pj+i[ftrap])  >  ,  for  every 

j  £  {1,...,N  -1}. 

(b)  the  evader  steps  outside  the  sensing  disc  of  every 
pursuer:  In  a  trapping  chain,  the  overlap  between  the  sensing 
discs  of  any  two  neighboring  pursuers  has  the  property  that 
length  of  the  common  chord  of  these  discs  is  greater  than 
|fstep-  This  means  that  even  if  any  two  neighboring  pursuers 
Pj  and  Pj+\  happen  to  move  parallel  to  each  other,  we  have 

dist  (e[ftrap] ,  Pj  [ftraplPj+l  [^trap])  ^  ® 

We  now  present  the  proof  of  Theorem  III. 9. 

Proof  of  Theorem  III.  9: 

We  first  look  at  a  case  in  which 

dist(e[f  sweep] :  Pj  [fsweep]Pj+l  [^sweep]  )  '  2^* step  for  Some 

j  £  {1, . . . ,  N—  1}.  In  this  case,  the  evader  is  already  trapped 
within  the  sensing  radii  of  the  pursuers,  from  part  (ii)  of 
Lemma  IV.8  and  the  result  holds. 

NOW  let  dist(e[f sweep] ,  Pj  [fsweep]Pj+l  [fsweep])  ^  fol 

every  j  £  {1, . . . ,  N  —  1}.  There  are  two  possibilities:  if  e[f  + 
1]  €  <Se[£],  for  any  t  >  f sweep,  then  there  is  a  pursuer  pj  for 
which  ye[t  +  1]  =  e[t  +  1].  This  pursuer  uses  part  (ii)  of 
the  ClRCUMCENTER  strategy  and  the  evader  is  trapped  within 
the  sensing  radius  of  pj .  Part  (iii)  of  the  result  follows  using 
Lemma  IV.9. 

So,  let  c [t sweep  +  1]  N[£SWeep]-  Now,  the  pursuers  com¬ 
pute  the  circumcenter  O  of  Ap2[f sweep] e[fsweepW-i[f sweep]- 
Lemma  IV.8  implies  that  the  evader  cannot  step  out  of  the 
pursuers’  convex  hull  by  crossing  line  pj[t\pj+i[t\,  for  any 
j  £  {1,...,N  —  1}.  Thus,  it  suffices  to  show  that  the 
evader  cannot  leave  the  pursuers’  convex  hull  by  crossing  line 
Pi  [£]ptv  [£]  ■  Li  fact,  we  show  that  at  the  end  of  every  pursuer 
move,  the  evader  remains  on  the  same  side  of  P2PN-1  until 
it  gets  trapped.  We  argue  this  as  follows.  As  illustrated  in 
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Figure  20,  any  point  on  lines  p2  [iswccp]  O  and  pjv-i[4weep]0  is 
reached  faster  by  p2  and  pn-i  respectively  than  by  the  evader. 
Thus,  the  motion  of  the  evader  is  confined  to  the  convex  hull  of 
{ O , P2 ,  ■  •  •  which  reduces  to  the  point  O  in  a  number 

of  time  steps  upper  bounded  by 


max 

je{2,...,Ar— 1} 


!  Pj  [f  sweep] 


'  step 


which  is  essentially  the  time  taken  by  the  furthest  pursuer  to 
reach  O.  Thus, 


t 


trap  ^ 


R  T  l  T  2ftrstep 

Tstep 


where  R  denotes  the  circumradius  of 
Ap2[f sweep] e[tSweepW-i[f sweep]-  From  elementary  geometry, 
at  time  fsweep  we  have 


R  =  \\P2  ~  ell lbiv-i  ~  e\\\\P2  -Pn-i\\ 
4Area(Ap2epjv-i) 

/  T  2Krstep 

“  2sinZep2pjv-i  ’ 

^  1  -f  2  KTslcp 

—  2  sin  <f>  ’ 


where  the  second  and  third  inequalities  follow  from  part  (iii) 
of  Lemma  IV.8.  Thus,  part  (ii)  of  the  theorem  follows  from 
the  use  of  right  hand  inequality  in  Proposition  IV.7. 


Fig.  20.  A  move  of  the  ClRCUMCENTER  strategy.  The  evader  is  confined 
to  the  shaded  region.  The  circles  around  the  pursuers  represent  their  sensing 
discs. 


To  prove  part  (iii),  recall  that  pursuers  pi  and  /y.y  move 
parallel  to  p2  and  Pjv-i>  respectively.  Since  the  evader  remains 
inside  the  convex  hull  of  pursuers  p2, . . . ,  p,v_i ,  the  distance 
of  the  evader  from  line  piPn  is  always  greater  than  Api,  until 
it  gets  trapped.  From  this  fact  and  Lemma  IV.9,  part  (iii)  now 
follows.  ■ 

Proof  of  Theorem  III. 10:  Part  (i)  follows  from  the  PLANES 
strategy.  Thus,  once  the  evader  is  trapped,  it  remains  trapped  at 
all  successive  time  instants  when  the  pursuers  use  the  PLANES 
strategy.  Thus,  the  problem  is  reduced  to  one  with  unlimited 
sensing  for  the  pursuers.  To  show  that  the  algorithm  leads  to 
capture  in  finite  time,  we  refer  the  reader  to  [5]. 

We  now  determine  an  upper  bound  on  the  time  taken  for 
the  capture  phase  in  terms  of  the  trapping  chain  parameters. 
Referring  back  to  the  proof  of  correctness  of  the  PLANES 


strategy,  since  the  evader  is  in  the  convex  hull  of  the  pursuers, 
let  Vj  denote  vectors  of  magnitude  rstep  in  the  direction  of 
Pj  trap ]  —  ^ [f trap]  •  We  now  wish  to  seek  a  lower  bound  on  the 
radius  e  of  the  largest  circle  centered  at  the  origin  that  can 
be  inscribed  inside  the  convex  hull  of  the  vectors  Vj.  This 
is  equivalent  to  determining  what  is  the  largest  of  the  angles 
[itrap]  e  [Lrap]fb  [brap]  •  Due  to  the  property  (11)  and  to  the 
fact  that  the  distance  between  any  two  adjacent  pursuers  in  the 
trapping  chain  is  non-increasing  during  the  ClRCUMCENTER 
strategy,  the  angle  [f  lrap]  e  [f trap]p7  [f trap]  is  the  greatest  when 

i  and  j  are  adjacent  and  the  evader  is  equidistant  from  both 
of  them  and  at  a  distance  of  r-ffL  from  pt  [ftrap]p?  [Lrap]  •  This 
is  shown  in  Figure  21.  This  gives,  e  =  rstep  r^2  = 

Now,  following  [5],  we  observe  that  there  exist  three  pursuers 
which  contain  the  evader  within  their  convex  hull  at  the  end 
of  the  pursuit  phase,  such  that  the  sum  of  the  distances  of 
these  pursuers  to  the  evader  decreases  by  at  least  |  at  the  end 
of  every  pursuer  move.  The  result  now  follows  from  the  fact 
that  the  distance  between  any  one  of  the  three  pursuers  and  the 
evader  is  at  most  l  +  2nrstep  at  the  end  of  the  pursuit  phase  and 
the  use  of  the  right  hand  side  inequality  in  Proposition  IV.7. 


Fig.  21.  Illustrating  proof  of  Theorem  III.  10.  Given  the  evader  to  be  at  a  dis¬ 
tance  greater  than  Ap  from  the  line  PiPj.  the  angle  Z.pi  [ftrap]e[ftrap]py  [ftrap] 
is  the  greatest  when  the  evader  is  equidistant  from  both  of  them 


V.  Simulation  Results 

We  now  present  simulation  studies  to  investigate  the  ro¬ 
bustness  of  the  algorithms  to  sensing  errors.  We  study  the 
performance  of  the  algorithms  in  several  cases  such  as  differ¬ 
ent  sensing  to  stepping  radius  ratio  and  faster  evader.  We  also 
study  the  case  of  different  number  of  pursuers  in  the  multiple 
pursuer  problem.  All  simulations  were  run  using  MATLAB®. 

In  the  context  of  sensing  errors,  we  assume  two  types  of 
error  models: 

Gaussian  errors:  We  assume  zero-mean  additive  Gaussian 
measurement  errors  in  the  position  of  the  evader  with  a 
standard  deviation  given  by 

<7j[t]  =4Pj[t\  -e[t]||, 

for  some  constant  e  >  0.  This  means  that  the  uncertainty  in 
the  location  of  the  evader  increases  with  its  distance  from  a 
pursuer.  The  evader  is  defined  to  be  captured  if  the  probability 
of  the  evader  being  inside  the  motion  disc  of  the  pursuer  before 
the  pursuer’s  move  is  more  than  a  certain  threshold.  In  other 
words,  for  some  t  and  for  some  pursuer  p3 , 

^[fi(^befW)  C  -  1]), 

where  Baj  [t]  (j/bef  M )  denotes  the  circular  region  of  radius  aj  [t] 
centered  at  y^e f[t]. 
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Non-Gaussian  errors :  The  measured  distance  is  given  by 
(l  +  e*)||pj[i]  —  e[i]||,  where  e*  is  a  random  variable  uniformly 
distributed  in  the  interval  [— e,  e],  where  e  >  0  is  the  specified 
error  parameter.  With  respect  to  angular  measurements,  if  9a 
is  the  actual  angular  location  of  the  evader  with  respect  to  a 
local  reference  frame  of  a  pursuer,  then  the  measured  angular 
location  is  given  by  9a  +  eg,  where  eg  is  a  random  variable 
uniformly  distributed  in  the  interval  [—AO,  AO],  where  the 
value  of  A 9  was  chosen  to  be  1  degree.  The  evader  is  captured 
in  this  model  if  y^Jt]  e  Bracp{pj[t  -  1]). 

A.  Single  pursuer  problem 

Under  the  considered  noisy  sensor  models: 

•  The  Sweep  strategy  remains  unchanged.  It  terminates 
when  an  evader  measurement  is  available. 

»  For  the  Greedy  and  Lion  strategies,  the  pursuer  uses 
the  noisy  measurements  of  the  evader  position  instead  of 
the  true  position  e[t\  to  compute  its  next  position. 

For  the  evader’s  motion,  we  assume  that  it  moves  away  from 
the  pursuer  with  some  randomization,  while  avoiding  the 
boundary.  Specifically, 

•  if  the  evader  is  not  close  to  the  boundary  of  the  environ¬ 
ment,  then  it  chooses  to  move  to  a  point  on  its  motion 
circle,  selected  uniformly  randomly  in  a  sector  with  angle 
0.2  radians.  This  sector  is  placed  symmetrically  along  the 
line  e[t]p[t]  and  away  from  the  pursuer. 

•  If  the  boundary  is  visible  to  the  evader,  then  it  chooses 
to  move  to  a  point  e[t  +  1]  on  its  motion  circle  such  that 
Ze[t  +  l]e[i]p[i]  =  n  —  0.2.  Of  the  two  points  possible, 
the  evader  chooses  that  point  which  is  further  away  from 
the  boundary.  In  other  words,  when  the  evader  reaches  the 
boundary,  it  chooses  to  move  to  a  point  that  is  away  from 
the  pursuer  as  well  as  not  very  close  to  the  boundary. 

For  our  simulations,  the  environment  is  a  circle  with  diameter 
40  units.  We  assume  k  =  5  units  and  rstep  =  1  unit.  The  initial 
position  of  the  evader  was  chosen  uniformly  randomly  in  the 
environment.  An  upper  limit  of  2,  000  time  steps  was  set  to 
decide  whether  the  strategy  terminated  in  a  success. 

The  following  is  a  summary  of  our  findings: 

(i)  Performance  of  the  strategy  with  noisy  measurements: 
The  plots  of  probability  of  success  of  the  strategy  and  average 
capture  times  after  detection  (given  that  the  strategy  terminates 
with  capture)  for  both  noise  models  versus  the  respective  error 
parameters  e  are  shown  in  Figure  22.  We  observe  a  similar 
trend  in  the  performance  of  the  strategy  in  both  noise  models. 

(ii)  Different  sensing  to  stepping  radius  ratios:  We  repeated 
the  simulations  for  the  cases  of  the  ratio  of  sensing  to  stepping 
ratio  k  =  7  and  n  =  10.  We  present  the  variation  of  probability 
of  success  in  the  Gaussian  noise  model  in  Figure  23. 

(iii)  Faster  evader:  We  repeated  the  simulations  for  the  case 
of  faster  evader.  Assuming  no  noise,  we  present  the  variation 
of  the  probability  of  success  in  the  top  part  of  Figure  24.  We 
observe  that  when  the  evader  is  at  least  3/2  times  that  of  the 
pursuer,  the  proposed  pursuer  strategy  is  not  efficient. 

B.  Multiple  pursuer  problem 

Under  the  considered  noise  models: 


Fig.  22.  Effect  of  measurement  noise  in  the  single  pursuer  problem.  For  a 
particular  evader  strategy,  we  study  how  the  capture  probability  and  average 
capture  time  given  that  the  strategy  succeeds,  vary  with  the  noise  parameter  e, 
under  Gaussian  and  Non-gaussian  error  models.  In  the  top  figure,  an  interval 
of  d=0.1  (not  shown  to  preserve  clarity)  about  the  probability  estimates  is 
the  95%  confidence  interval  given  by  P(e)  —  2  yj  ^5,  P{e)  +  2  yj  , 
where  n  =  100  is  the  number  of  trials  [23].  In  the  bottom  figure,  the  vertical 
bars  give  a  95%  confidence  interval  about  the  average  capture  time  T(e) 

which  is  given  by  [t(c)  -  2 y/ §p^y,T(e)  +  2^J ,  where  SD{e)  is 
the  standard  deviation  in  the  capture  time,  P(e)  is  the  estimated  probability 
of  success  and  n  =  100  is  the  number  of  trials  [23]. 


K=10 


Fig.  23.  Effect  of  varying  the  sensing  to  stepping  radius  ratio  k  in  the  single 
pursuer  problem.  For  a  particular  evader  strategy,  we  study  how  the  capture 
probability  varies  for  n  =  7  and  ft  =  10  with  the  noise  parameter  e,  under 
Gaussian  noise  model.  The  error  bars  are  in  accordance  with  Figure  22. 

•  The  Sweep  strategy  remains  unchanged.  It  terminates 
when  an  evader  measurement  is  available. 

.  For  the  ClRCUMCENTER  and  PLANES  strategies,  the 
team  of  pursuers  use  the  average  of  the  available  evader 
measurements  yt_i  [t—  1]  and  yt_|  [f],  to  compute  their  next 
positions. 

For  the  sake  of  simulations,  we  assume  N  =  7  pursuers 
with  k  =  5  units  and  rstep  =  1  unit.  We  assume  a  square  field 
of  edge  length  100  units,  where  the  evader  is  initially  placed 
at  a  uniformly  randomly  selected  location.  Upon  detection,  we 
assume  that  the  evader  moves  away  from  the  closest  pursuer 
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Single  pursuer  problem 


Evader/pursuer  speed  ratio 


Fig.  24.  Performance  of  Sweep-Pursuit-Capture  strategy  against  a  faster 
evader.  For  a  particular  evader  strategy,  we  study  how  the  capture  probability 
varies  for  higher  evader  speeds,  assuming  no  measurement  noise.  The  top 
figure  presents  the  single  pursuer  case  and  the  bottom  figure  presents  the 
multiple  pursuer  case.  The  error  bars  are  in  accordance  with  Figure  22. 


Fig.  25.  Effect  of  measurement  noise  in  the  multiple  pursuer  problem.  For 
a  particular  evader  strategy,  we  study  how  the  capture  probability  (top  figure) 
and  average  capture  time  (bottom  figure)  given  that  the  strategy  succeeds,  vary 
with  the  noise  parameter  e,  under  Gaussian  and  Non-gaussian  error  models. 
The  error  bars  are  in  accordance  with  Figure  22. 


with  some  randomization.  Specifically,  it  moves  to  a  point  on 
its  motion  circle,  selected  uniformly  randomly  in  a  sector  of 
angle  equal  to  0.2  radians.  This  sector  has  its  vertex  at  e[t\  and 
angle  bisector  parallel  to  the  line  e[fsweep]0,  where  fsweep  is  the 
time  when  the  evader  is  detected  and  O  is  the  circumcenter  of 
the  triangle  p2  [fsweep],  Pe[t sweep]  and  e[fsweeP]-  We  study  how 
the  average  capture  time  after  detection  varies  with  e.  An  upper 
limit  of  1000  time  steps  was  set  to  decide  whether  the  strategy 
terminated  in  a  failure. 

The  following  is  a  summary  of  our  findings: 

(i)  Performance  of  the  strategy  with  noisy  measurements: 
The  plots  of  probability  of  success  of  the  strategy  and  average 
capture  times  after  detection  (given  that  the  strategy  terminates 
with  capture)  for  both  noise  models  versus  the  respective  error 
parameters  e  are  shown  in  Figure  25.  We  observe  a  similar 
trend  in  the  performance  of  the  strategy  in  both  noise  models. 

(ii)  Different  number  of  pursuers:  We  repeated  the  simula¬ 
tions  for  the  cases  of  the  number  of  pursuers  N  =  10  and 
N  =  15.  We  present  the  variation  of  probability  of  success  in 
the  Gaussian  noise  model  in  Figure  26. 

(iii)  Faster  evader:  We  repeated  the  simulations  for  the  case 
of  faster  evader.  Assuming  no  noise,  we  present  the  variation 
of  the  probability  of  success  in  the  bottom  part  of  Figure  24. 
We  observe  that  when  the  evader  is  at  least  1.8  times  that  of 
the  pursuers,  the  proposed  pursuer  strategy  is  not  efficient. 

VI.  Biological  Interpretations 

Our  analysis  in  the  previous  sections  can  shed  light  on  the 
trade-offs  that  predators  face  when  deciding  upon  the  group 
size.  Based  on  our  results  from  Section  III-B,  we  now  study 
how  the  group  size  of  the  pursuers  varies  with  the  evader 
availability  in  the  multiple  pursuer  problem. 

For  simplicity,  we  assume  a  square  field  where  the  evader 
is  initially  located  and  denote  by  8  :=  dian^g^  the  evader 
density.  From  the  results  in  Section  III-B,  an  upper  bound  on 


N  =  10 


Error  parameter  e 

Fig.  26.  Effect  of  varying  the  number  of  pursuers  N  in  the  multiple  pursuer 
problem.  For  a  particular  evader  strategy,  we  study  how  the  capture  probability 
varies  for  N  =  10  and  TV  =  15  with  the  noise  parameter  e,  under  Gaussian 
noise  model.  The  error  bars  are  in  accordance  with  Figure  22. 

the  total  time  taken  by  the  pursuers  in  all  three  phases  of  the 
strategy  is  given  by 

1  cN 

— — - tt"  T  — 7= - T  kN : 

6{aN  +  b)  s/S(aN  +  b) 

where  a  :=  2rftepv/4«;2  —  25/7T,  b  :=  (2ttk  —  6\/Ak2  —  25) /n, 
c  :=  2rstepV/4K2  —  25  and  k  :=  y/An2  —  25(1  +  1/  sin  + 
18k-\/4k2  —  25  are  constants  independent  of  the  number  of 
predators  N  or  the  evader  density  5. 

From  part  (i)  of  Theorem  III. 7,  we  observe  that  when  all 
other  variables  are  kept  constant,  the  lower  bound  on  suc¬ 
cessful  detection  probability  of  the  Sweep  strategy  increases 
with  N.  However,  a  higher  N  results  into  a  greater  time  to 
capture  the  evader.  This  suggests  a  trade-off  on  the  group  size 
N  which  we  analyze  as  follows. 

Let  v  denote  the  nutritional  content  of  the  evader.  We 
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quantify  the  energy  spent  by  each  pursuer  as  the  time  taken 
to  capture  the  evader.  The  energy  gain  from  the  pursuit  is 
quantified  as  the  amount  of  nutrition  each  participating  pursuer 
receives  from  the  evader.  For  a  self-sustaining  pursuit,  we  must 
have  that  the  energy  gained  by  each  pursuer  is  at  least  equal 
to  the  energy  spent  during  the  hunt.  Thus, 

v  1  cN  ,  AT 

_ > _ I - 1_  kN 

N  S(aN  +  b)  \/S(aN  +  b) 

From  this  relation,  we  observe  that  for  a  given  value  of  5, 
there  exists  an  upper  limit  on  the  number  of  pursuers  in  the 
group  for  which  it  is  advantageous  for  the  pursuers  to  engage 
in  a  pursuit  with  the  prospect  of  gaining  energy.  A  plot  of  the 
upper  limit  on  the  group  size  N  versus  the  evader  density  S 
is  shown  in  Figure  27. 

This  analysis  shows  that  for  higher  values  of  S,  a  larger 
number  of  pursuers  can  be  accommodated  in  the  trapping 
chain.  This  is  consistent  with  observations  in  the  biology 
literature  by  Caraco  and  Wolf  [24]  that  have  reported  higher 
group  size  in  foraging  lions  during  the  wet  season  (prey 
abundance),  than  in  the  dry  season,  (prey  scarcity).  Further, 
from  our  analysis,  it  also  follows  that  for  a  given  evader 
density,  the  higher  the  prey  nutrition  value  v,  the  higher  is 
the  upper  limit  on  the  number  of  pursuers  in  the  trapping 
chain.  This  is  consistent  with  the  observations  reported  by 
Griffiths  [25]  regarding  how  the  size  of  hunting  packs  relate 
to  the  size  of  the  prey  relative  to  that  of  the  predators. 


environment  and  moves  only  when  detected.  We  propose  a 
Sweep-Pursuit-Capture  strategy  for  both  problems. 

In  the  first  problem,  we  give  sufficient  conditions  on  the 
range  of  values  taken  by  the  ratio  of  sensing  to  stepping  radius 
of  the  players  so  that  the  Greedy  pursuer  strategy  of  moving 
towards  the  last-sensed  evader  position  leads  to  the  evader 
being  trapped  within  the  pursuer’s  sensing  disc  and  finally 
to  capture.  We  also  give  conditions  under  which  there  exist 
locations  from  which  the  evader  can  escape.  In  the  second 
problem,  we  have  shown  that  the  pursuers  capture  the  evader 
with  a  certain  probability  that  is  independent  of  the  initial 
evader  location  in  a  bounded  region.  We  give  novel  upper 
bounds  on  the  total  time  taken  to  capture  for  both  problems. 
We  also  present  simulation  studies  that  suggest  robustness  with 
respect  to  sensing  errors.  Finally,  on  the  basis  of  the  obtained 
upper  bound  on  the  total  capture  time,  we  provide  an  upper 
bound  on  the  group  size  of  the  pursuers  for  which  the  pursuit 
is  advantageous  from  the  point  of  view  of  gaining  energy. 
The  conclusions  based  on  our  analysis  are  consistent  with 
observations  reported  in  ecology  literature. 

In  the  future,  it  would  be  interesting  to  study  the  effects 
of  communication  losses  or  errors  in  the  multiple  pursuer 
problem.  Another  interesting  direction  for  future  research 
would  be  to  consider  more  sophisticated  sensing  models  for 
the  players. 
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Fig.  27.  Plot  of  maximum  group  size  of  pursuers  that  can  be  sustained  versus 
the  evader  density  8,  for  k  =  5,  rstep  =  1,  v  =  10000. 


VII.  Conclusions  and  Future  Directions 

We  have  addressed  discrete-time  pursuit-evasion  problems 
in  the  plane  with  sensing  capabilities  restricted  to  a  finite  disc. 
We  consider  two  variants  of  the  pursuit-evasion  in  discrete¬ 
time.  The  first  involves  a  single  pursuer  and  an  evader  in  a 
bounded  convex  environment.  The  second  is  a  formation  de¬ 
sign  problem  for  multiple  communicating  pursuers  to  capture  a 
single  evader  in  a  boundaryless  environment.  In  both  problems, 
the  evader  is  initially  located  inside  a  bounded  subset  of  the 
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